Volume 4 -- Issue 10 July, 1984 


In This Issue... 


18-Digit Arithmetic, Part 3... 
Building Label Tables for DISASM 
Quick Memory Testing ...... 
68000 Sieve Benchmark. .... 
Updating the 6502 Prime Sifter 
Sorting and Swapping. . 
Apple //c Gotchas. . .... . 
Orphans and Widows ..... . 
Speed vs. Space. . . « « « « e 


eee © © © © 0 
oe e« © © © © eo 
ee e© © © © © © 
ee © © © © 8 8 le 
oe e © © © © 8 
oe © © © © © ee 
oe e © © © © 8 
ee « © © © 8 ele 
oe ee © © © © ele 
ee «© © © © © 8 
eo 28 © © © © © ee 
ee «© © © © ee 
eee © © © © © 

sc , 

© 


Feedback on our DOSonomy 


Our little dossier of DOS names was well received. It may be 
we will soon have so many names we will need a dosser (a large 
basket that can be carried on the back) to hold them all. On 
the other hand, if we keep writing about this our fortunes may 
reverse, forcing to finding new quarters in a doss house. What 
is the critical dosage? 


Dan Pote offers “Kinda-Sorta~DOS". Which led Bill to coin 
“MaybeDOS". Randy Horton reminded us of “Ante-DiluviDOS". 
Chris Balthrop enters MacroDOS and “What's Up DOS". (I think 
the latter is “Buggy”. Or "Bugsy"? Oh, it's not bunny 
anymore...) If you can take all this, you may be too docile. 


Don Lancaster Strikes Again 


We just have a little space and a little time to mention Don's 
new Assembly Cookbook for the Apple II/IIe, which just arrived. 
It looks like another winner! Look for a full review next 
month, or check our ad on page 3 for ordering info. 


18-digit Arithmetic, Part 3....eceeeeeeeeee-BOD Sander Cederlof 


Plowing ahead, this installment will offer the division and 
input conversion subroutines. 


You will remember that we covered addition and subtraction in 
the May 1984 issue, and multiplication in June. Now it's time 
for division, which completes the fundamental arithmetic 
Operations. All four of these routines are designed to operate 
on two arguments stored in DAC and ARG, leaving the result in 
DAC. Addition and subtraction leave “garbage” in ARG. 
Multiplication leaves ARG unchanged. Division leaves in ARG 
what was in DAC. 


Division is simple enough in concept, but no one would call it 
simple in implementation. “How many groups of X are in Y?* 
“If I deal an entire deck of 52 cards to 4 people, how many 
will each person get?” “If I scramble a dozen eggs, and serve 
them in equal-size portions to 7 people, how many eggs will 
each eat?“ (Really, I am good cook!) 


Suppose I have a pile of pennies, and want to find out how many 
dollars they represent. I will count out piles of 100 pennies, 
moving them into separate piles. Then I will count the little 
piles. Now, suppose I have two 18-digit numbers in my computer 
and want to divide the one in ARG by the one in DAC.... I will 
subtract the value in DAC from the one in ARG over and over, 
until I finally cross zero. Then if I was wise enough to count 
how many times I did the subtraction, I have the answer. 


Let's look at the problem in more detail now. What I want to 
do is divide the value in ARG by the value in DAC: 


numerator (in ARG) 
sete ems eiete ie aoe = quotient (in DAC) 


denominator (in DAC) 


Numbers in DP18 can be positive or negative, so we have to 
remember the rules of signed division. If the signs of the 
numerator and denominator are the same, the quotient will be 
positive; if they are different, the quotient will be negative. 


Numbers in DP18 are coded as 18-digit fractions with a power- 
of-ten exponent. Remembering algebra: 


ef * 10°m f 
salen = <--- * 10° (m-n) 
-g * 10°n g 


The 18-digit fractions are normalized so that there are no 
leading zeroes. That is, the value will either be all zero, or 
it will be between .1 and .999999999999999999 (inclusive). 


I think it is time now to start looking at the program. In the 
listing which follows there are references to subroutines and 
variables which we defined in the previous two installments of 
this series. 
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S-C Macro AsSembler Version L.iOwccccccccccccseccvccccesccccccccccses es 900 
S-C Macro Assembler VerSion lLoleccecccccvccccececevcvescscccccce see 9 I20 90 
VerSion l.l Update... .cccccccccccccccccccccccccecccescccscccscessce Gl 2.50 
Source Code for Version 1.1 (on two disk SideS).WN.cccccvccccccccceee SlO0 
Full Screen Editor for S-C Macro (with complete source code) .........-$49 
S-C Cross Reference Utility (without source Code). .ccrcccccccesceseeee 920 
S-C Cross Reference Utility (with complete source COde) ...ecccceeeee ee $50 
DISASM Dis-Assembler (RAK-“Ware) cccccccccscccccccccccccccsccececceseese GSU 
Source Code for DISASM..cccccccccccccscccccccccccccccvcsescadditional $30 


S-C Word Processor (with complete SOUrCe COdE)...cccccrcccccecesceseee GIO 
Double Precision Floating Point for Applesoft (with source code)......$50 
S-C Documentor (complete commented source code of Applesoft ROMs).....$50 
Source Code of //e CX & FB ROMS On GiBk..ccccccccrecrcccccccecccccsses SLO 


(All source code is formatted for S-C Macro Assembler Version 1.1. Other 
assemblers require some effort to convert file type and edit directives.) 


AAL Quarterly DVEK 650 6 oS ws £48 oe ea RA Owe aw ween CAEN $15 
Each disk contains all the source code from three issues of “Apple 
Assembly Line", to save you lots of typing and testing time. 

QD#1: Oct-Dec 1980 QD#2: Jan-Mar 1981 QD#3: Apr-Jun 1981 
QD#4: Jul-Sep 1981 QD#5: Oct-Dec 1981 QD#6: Jan-Mar 1982 
QD#7: Apr-Jun 1982 QD#8:; Jul-Sep 1982 QD#9: Oct-Dec 1982 
QD#10: Jan-Mar 1983 QD#1]11: Apr-Jun 1983 QD#12: Jul-Sep 1983 
QD#13: Oct-Dec 1983 QD#14: Jan-Mar 1984 QD#15: Apr-Jun 1984 


AWIIe Toolkit (Don Lancaster, SynergeticsS) ...cccccccccccccccccccccseee S39 
Quick-Trace (Anthro-Digital) ...cccrcccccccccccccccccccccee (reg. $50) $45 
Visible Computer: 6502 (Software Masters) .......ccccccceee (reg. $50) $45 
ES-CAPE: Extended S-C Applesoft Program Edit Or....cccccccccccccccvecee 900 
Amper-Magic (AnNthro-DiGgital ) ss sds dios <seiis 6 Seews 66 ee 6ee LEOGs $75) $65 
Amper-Magic Volume 2 (Anthro-Digital) evrecveeveeeeenvseeeoevecen {reg. $35) $30 
Routine Machine (Southwestern Data Systems) ....e.eeeee- (reg. $64.95) $60 
“Bag of Tricks", Worth & Lechner, with diskette......ccccccee ($39.95) $36 
FLASH! Integer BASIC Compiler (Laumer Research) ....ccccceccccccccessee9l9 
Fontrix (Data Transforms) é.cice:s <ci6.0 wwe 0 00.0.4 006 600.0 oS 4s HOw ee We een eee 1S 
Aztec C Compiler System (Manx Software) .....cccccccceceee (reg. $199) $180 


Blank Diskettes (Verbatim)........eee.2.90 each, or package of 20 for $45 
(Premium quality, single-sided, double density, with hub rings) 

Vinyl disk pages, 6"x8.5", hold two disks each.....cccccccccccee lO For $6 
Diskette Mailing Protectors (hold 1 or 2 disks).............40 cents each 

or $25 per 100 

These are cardboard folders designed to fit into 6°X9" Envelopes. 

Envelopes for Diskette MalilerS....cccccccccccsccccccccscccee 6 Cents each 
ZIF Game Socket Extender (Ohm Electronics) .cccccccccccccccccccvceesese S20 


Books, BOOKS, BOOKS... .ccccccccccsccsecvcceeeeCOMpare OUr discount prices! 
"Apple jJ{ Circuit Description’, Gayler...cccccccccccccceee ($22.95) $21 
“Understanding the Apple II", Sather...ccccccccccccccece ($22.95) $21 
“Enhancing Your Apple II, vol. 1", LancaSter......ccceee ($15.95) $15 

Second edition, with //e information. 
“Assembly Cookbook for the Apple II/IIe", Lancaster.....($21.95) $20 
“Incredible Secret Money Machine®, Lancaster.....cceccccee ($72.95) $7 
"Beneath Apple DOS", Worth & Lechner.....cccccccscccecee ($19.95) $18 
“Assembly Lines: The Book", Roger Wagne€r...cccccscccccee ($19.95) $18 
“What's Where in the Apple", Second Edition............-($19.95) $19 
“What's Where Guide” (updates first edition) .....ccccceee ($9.95) $9 
"6502 Assembly Language Programming", Leventhal........-($18.95) $18 
"6502 Subroutines", Leventhal... .ccccccsecsccccccvccseccse ($17.95) $17 
"Real Time Programming -- Neglected Topics", Foster......($9.95) $9 


We have small quantities of other great books, call for titles & prices. 
Add $1.50 per book for US shipping. Foreign orders add postage needed. 


*** S-C SOFTWARE, P. O. BOX 280300, Dallas, TX 75228 *** 
baal (214) 324-2050 wee 
*** We accept Master Card, VISA and American Express *** 
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Line 4250 swaps the contents of ARG and DAC. I did it this way 
because it leaves something possibly useful in ARG after the 
division is finished. If you wanted to form the reciprocal 
quotient, DAC=DAC/ARG, you can enter at DDIVR, which skips the 
swapping step. 


Lines 4260-4270 check for the illegal case of division by zero. 
If I divide something into zero-size parts, I get an infinite 
number of these parts. That's fine, but the DP18 has no 
representation for infinity; therefore we say it is illegal to 
Givide by zero, just like Applesoft does. Some computers and 
some software arithmetic packages do represent infinity, but 
DP18 does not. Zero values are represented by having an 
exponent byte of zero, so we only have to check one byte here. 


Lines .4280-4310 form the sign of the quotient. This is the 
same as lines 1280-1310 of the DMULT listing given last month, 
and so we could make them into a subroutine. The subroutine 
would take 10 bytes, and the two JSR's make another 6. That's 
16 bytes, against the 18 bytes for the two versions of in-line 
code. Saves a total of 2 bytes, at a cost of adding 12 cpu 
cycles to both multiply and divide. (Small digression into the 
kind of trade-offs I am continually making....) 


Lines 4330-4390 compute the exponent of the quotient, and check 
for overflow and underflow cases. The special case of the 
numerator being zero is also caught here, line 4350. Line 4380 
restores the bias of $40. Bias? Remember, the exponent is 
kept in memory with $40 added to it, so that the range -63 
through +63 is represented by $01 through $7F. 


If the new exponent is still in the range $00 through S7F, we 
will go ahead and do the division. If not, the quotient is 
either too small (underflow) or too large (overflow). For 
example, 10°-40 / 10°40 results in 10°-80, which is too small 
for DP18. Lines 4410-4470 catch these cases, and change the 
quotient to zero. If the new exponent is between $80 and SBF, 
it represents 10°64 or larger, and so we call on the Applesoft 
OVERFLOW error. 


Lines 4500-4550 set up the loop which does the actual division 
of the fractions. The 6502's decimal mode will be used during 
this loop. Ten bytes in MAC (defined in DMULT last month) will 
be used to hold the quotient until we are through with DAC. 

The X-register will be used to count out the 20 digits. The 
other end of the loop is in lines 4920-4930, where X is 
decremented and tested. 


The body of the loop is really a lot simpler than it looks. 
Basically, ARG is subtracted from DAC until DAC goes negative. 
The number of subtractions is counted in MAC+9. Then ARG is 
added back to DAC to make it positive again, and MAC+9 
decremented. The result is a quotient digit in MAC+9, and a 
remainder in DAC. One extra digit is needed, extending DAC on 
the left end. This digit is carried in the stack. See it 
pushed at line 4710, pulled at line 4790. 
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After each digit of the quotient is determined, both MAC and 
DAC are shifted left one digit place. This might shift a 
significant digit out of DAC (the remainder), so it is lifted 
out first and saved on the stack (lines 4570-4630). If the 
first two digits of the remainder (happen to be "00", then we 
know without subtracting that the quotient digit in this 
position will also be “0O", (Remember that the leading digit of 
the denominator in ARG is NEVER zero.) This fact can speed up 
divisions, so it is tested for at line 4580, with lines 
4670-4680. 


After all 20 digits are formed, the loop terminates. Line 4950 
then returns us to binary mode. Line 4960 adds one to the 
quotient exponent, adjusting for the normalization step. 

(.9/.1 = 9, but we want to represent it as .9*10°l.) If the 
exponent now is negative ($80), it may be still in range if the 
leading digit of the quotient is zero (.1/.9 = O.11ll...). 

This test takes place at lines 4970-5000. 


Lines 5020-5060 copy the quotient from MAC to DAC. These are 
the same as lines 1330-1370 in DMULT, so they could be made 
into a subroutine. Two other candidates for subroutines are 
lines 4720-4780, which are identical to lines 1680-1740 of DADD 
(May 1984); and lines 4830-4890, which are the same as 
1530-1590 of DADD. 


Finally, DDIV exits by jumping to NORMALIZE.DAC. 


Doesn't all this take a lot of time? You bet it does! I timed 
it in the full DP18 package with a program that looked like 
this: 


&DP:INPUT X(0) : INPUT X(2) 
FOR I = 1 TO 100 

&DP:X(4) = x(0)/X(2) 

NEXT 


I determined the loop overhead by entering a value zero for 
X(0). Since this case skips around nearly everything in DDIV, 
I called its time the loop overhead time. After subtracting 
out the loop overhead, the times look like this: 


0/anything 0 
x/x 12 msec 
1/9=.1111... 23 msec 
8/9=.8888... 49 msec 
1/7=.142857... 35 msec 


It looks like the maximum time, which would be for a quotient 
with all 20 digits = 9, would be about 53 msec. The average 
time, about 35 msec. This compares with an average Applesoft 
9-digit division time of about 7 msec. 


4250 DDIV. JSR SWAP.ARG.DAC ...CHANGE TO DAC = DAC/ARG 
4260 DDIVR LDA ARG.EXPONENT CHECK FOR ZERO DENOMINATOR 
4270 BEQ .2 ~++X/0 IS ILLEGAL 
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#--—FORM SIGN OF QUOTIENT -------- 


DA T EXTENS 
SEE IF FIRST TWO Tt 


DIVISION BY ZERO ERROR 


i? aLsd ZERO 


4390 LDA DAC.SIGN 

4300 EOR ARG. SIGN 

4310 STA DAC. SIGN 

4320 ®---COMPUTE EXPONENT OF QUOTIENT- 
43h0 LDA DAC. EXPONENT 

4350 BEQ . -. . 0/Xe0 

4360 SBC ARG.EXPONENT 

4370 CLC 

4380 ADC #$40 ADJUST OFFSET 
4390 STA DAC. EXPONENT 

4400 #---CHECK OVER/UNDERFLOW--------- 
4410 BPL .3 ». NELTHER 
4420 ASL SEE WHICH... 
4430 BPL .1 +» OVERFLOW 
WHO .0 LDA #0 | "TTDNDERFLOW, SET KESULT = 0 
4450 STA DAC.SIGN 

4460 STA DAC. EXPONENT 

4470 RTS 

4480 04 JMP AS.OVRFLW 

4490 : IMP AS.ZRODIV 

4500 &--SET UP QUOTIENT LOOP--------~ 
4510 . SED DECIMAL MODE 
4520 LDA #0 

4530 STA MAC+9 CLEAR FIRST QUOTIENT D10J7 
46 LDX #20 DO 20 DIGITS 
4550 BNE .5 ..e ALWAYS 
4560 ®---CONTINUE QUOTIENT LOOP------- 
W570 «4 LDA DAC. 

4580 PHP SAVE ZERO STATUS 
4590 LSR 

4600 LSR 

4610 LSR 

4620 LSR 

46 30 PHA DAC LEFT EXTENSION 
4640 JSR SHIFT.DAC.LEFT. ONE 

4650 SSh SHIFT.MAC. LEFT. ONE 
4660 PLA C LEF 

4670 

680 

4690 

4700 

4710 

4720 

4730 

4740 


PLP 
BEQ .9 nesYES, 80 QUOTING 
#---SUBTRACT WHILE POSSIBLE------ 
5 INC MAC+9 COUNT be eaTENST ON” 
PHA DAC LEFT EXTENS! 
SEC DO A TRIA 
LDY #9 
7 LDA DAC.HI,Y 
4750 SBC ARG. HI, 
4760 STA DAC.HI,Y 
4770 DEY 
4780 BPL .7 
4790 PLA DAC LEFT EXTENSION 
4800 SBC #0 
4810 BCS .5 NO BORROW 
4820 #---OVERSHOT, SO RESTORE--------- 
48 30 LDY #4 BORROW,SO ADD IT HACK IN 
4840 CLC 
4850 .8 LDA DAC.HI,Y 
4860 ADC ARG.HI,Y 
4870 STA DAC.HI,Y 
4 DEY 
4890 BPL .8 
4900 DEC MAC+9 BACK OFF QUOTIENT DIUIT, TOO 
4910 #---NEXT DIGIT------------------- 
4920 .9 DEX ALL DIGITS? 
4930 BNE .NOT YET, KEEP UOINQ 
4ON0 #---ADJUST EXP, CHECK OVERFLOW--- 
4950 CLD BINARY MODE 
4960 INC DAC.EXPONENT ADJUST FOR OFWIT 
4970 BPL .10 . eNO OVERFLOW PROM 
4980 LDA MAC COULD BE OVERFLOW 
4990 AND #$F0 
5000 BNE . . » OVERFLOW 
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Copyrtyht (C) 8-C SOFTWARE 


Automatically date 
stamps files with 
PRO-DOS 


THE NEW TIMEMASTER I! 


, NEW 1984 

} DESIGN 
An official 

PRO-DOS Clock 


WATT ens. 
Just plug it in and your programs can read the year, month, date, day, 
and time to 1 millisecond! The only clock with both year and ms. 
NiCad battery keeps the TIMEMASTER It running for over ten years. 
Full emulation of ALL other clocks. Yes, we emulate Brand A, Brand T, 
Brand P, Brand C, Brand S and Brand M too. It's easy for the 
TIMEMASTER to emulate other clocks, we just drop off features. 
That's why we can emulate others, but others CAN’T emulate us. 


the nearest millisecond. 


Apple Peripherals Are All We Make 


That’s Why We’re So Good At It! 


Super Music Synthesizer 
Improved Hardware and Software 


eA ot et 
a 


af bey } 
Complete 16 voice music synthesizer on one card. Just plug it into 
your Apple, connect the audio cable (supplied) to your stereo, boot 
the disk supplied and you are ready to input and play songs. 
It’s easy to program music with our compose software. You will start 


@ The TIMEMASTER II will automatically emulate the correct clock card right away at inputting your favorite songs. The Hi-Res screen shows 
for the software you're using. You can also give the TIMEMASTER Ila what you have entered in standard sheet music format. 
simple command to tell it which clock to emulate (but you'll like the @ Now with new improved software for the easiest and the fastest 
Timemaster mode better). This is great for writing programs for those music input system available anywhere. 
poor unfortunates that bought some other clock card. We give you lots of software. in addition to Compose and Play 

@ Basic, Machine Code, CP/M and Pascal software on 2 disks! programs, 2 disks are filled with over 30 songs ready to play. 

@ Eight software controlled interrupts so you can execute two programs @ Easy to program in Basic to generate complex sound effects. Now 
al the same time (many examples are included). your games can have explosions, phaser zaps, train whistles, death 

@ Orboard timer lets you time any interval up to 48 days long down to cries. You name it, this card can do it. 


Four white noise generators which are great for sound effects. 


The TIMEMASTER I includes 2 disks with some really fantastic time onented 
programs (over 4Q) including appointment book so you'll never forget to do anything 
again. Enter your appointments up to a year in advance then forget themy, Plus DOS 
dater so it will automatically add the date when disk files are Created of modified. The 
disk ts over a $200.00 value alone--we give the software others sell. All software 
packages for business, data base management and communications are made to 
tead the TIMEMASTER U1. If you want the most powerful and the easiest to use Glock 
for your Apple. you want a TIMEMASTER IL 
PRICE $129.00 


Plays music in true stereo as well as true discrete quadraphonic. 
Full control of attack, volume, decay, sustain and release. 

Will play songs written for ALF synthesizer (ALF software will not take 
advantage of all our card’s features. Their software sounds the same 
in our synthesizer.) 

Our card will play notes from 30HZ to beyond human hearing. 
Automatic shutoff on power-up or if reset is pushed. 


Many many more features. PRICE $159.00 


Viewmaster 80 


There used to be about a dozen 80 column cards for the Apple, now 
there’s only ONE. 


Z-80 PLUS! 


‘a ” 
- —.: 


aa fac @ TOTALLY Videx Compatible. 
Co ae @ 80 characters by 24 lines, with a sharp 7x9 dot matrix. 
a] ae @ On-board 40/80 soft video switch with manual 40 column override 
Wi O80: allel @ Fully compatible with ALL Apple languages and software—there are 
. -— NO exceptions. 
@ Low power consumption through the use of CMOS devices. 
ms @ All connections are made with standard video connectors. 
. @ Both upper and lower case characters are standard. 
nd tee only ; tee nae : FC Pn dca co @ All new design (using a new Microprocessor based C.R.T. controller) 
: : for a beautiful razor sharp display. 
@ Fully compatible with microsoft disks (no pre-boot required). @ The VIEWMASTER sscoaporales suike features of all other 80 column 
@ Specifically designed for high speed operation in the Apple Ile (runs cards, plus many new improvements. 
just as fast in the 11+ and Franklin). : os BR ah Sagas 
@ Runs WORD STAR, dBASE I!, COBOL-80, FORTRAN-80, RT Semen redeem” “Optaion” coaritan 
PEACHTREE and ALL other CP/M software with no pre-boot. VIEWMASTER =—-179 VES YES «so ydS—(isES—VES ESE ves 
@ A semi-custom 1.C. and a low parts count allows the Z-80 Plus to fly SUP RTE RAS ee . oF ae ies ae a ee 
: a "S is WIZARD BO iN SO : : 
ee at avery low power level. (We use the Z-80A at ae isi oe <a uc te aS Re ae es 
, ‘ : N© NO YES YES 
@ Does EVERYTHING the other Z-80 boards do, plus Z-80 interrupts. Sead ie Sad. aa, eae, a ta 
SMAKTER MORE vis is NO axe) NO YES Yts NO 
Don't confuse the Z-80 Plus with crude Copies of the nucrosoftcard. The aes MORI < Le YES wes va WS NO vES 


2-80 Plus employs amuch more sophisticated and reliable design, With 
the Z-80 Plus you can access the largest body of software in existence. 
Two computers in one and the advantages of both, all at an unbelievably 


low price. PRICE $139.00 


The VIEWMASTER 80 works with all 80 column applications including CP/M, 
Pascal, WordStar, Format HH. Easywriter, Apple Writer t!, VisiCalc, and all 
others. The VIEWMASTER 80 is THE MOST compatible 80 column card you 


can buy at ANY price! PRICE $179.00 


@ Expands your Apple tle to 192K memory. MemoryMaster tle 128K RAM Card 
e Provides an 80 column text display. @ Precision software disk emulation for Basic, Pascal and CP/M is 
@ Compatible with all Apple tle 80 column and extended 80 column available at a very low cost. NOT copy protected. 
Card software (same physical ee Apple's 64K card). @ Documentation ing luded, we show you how to use all 192K. 
@ Can be used as a solid state disk drive to make your programs ran up Ifyou already have Apples 64K card. just order the MEMORYMASTER te with 64K and use 
to 20 times FASTER (the 64K configuration will act as half a drive). the 646 from your old board to give you a lull 128K. (The board ws fully socketed so you 
@ Permits your Ile to use the new double high resolution graphics. sunply plug in more Chips ) 
@ Automatically expands Visicale to 95 K storage in 80 columns! The MemoryMaster Ile with 128K $249 
64K config. is all that's needed, 128K can take you even higher. Upgradeable MemoryMaster tle with 64K $169 
@ 


PRO-DOS will use the MemoryMaster lle as a high speed disk drive. Non Upgradeable MemoryMaster Ile with 64K $149 


Our boards are lac supenortto most of the Consumer electrons made today ALLL CG (sate in high quality sockets with mikypec components used throughout PC boards are glass: 
epoxy with gold Contacts Made in Amenca to be the bestin the world All products workin the APPLE TIE TL TE and Franklin The MemoryMastcr ig ts He only Appled tagineenng 
also manufactures a tulllne of data a Quistbon and Control products forthe Apple: A/D Converters and digital I/O cards, etc Please call formore iatotmation Allour products ate tully 
tested with complete documentation aod available tor immediate detivery All products ate gusrantced with a no hassle THREE VEAR WARRANTY. 


Call (214) 492-2027 

8 am. to 11 p.m. 7 days a week 
MasterCard, Visa & C.0.D. Welcome 
No extra charge for credit cards 


Send Check of Money Order to: 
APPLIED ENGINEERING 
P.O. Box 798 
Carrollton, TX 75006 


Texas Residents Add 5% Sales Tax 
Add $10.00 if Outside U.S.A. 
Dealer Inquiries Welcome 
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5010 erica oe gee TO DACo------ == 
-10 Y #9 


5030 .11 CDA MAC ,Y 

5040 STA DAC. HI,Y 

5050 DEY 

5060 BPL .11 

207? JMP NORMALIZE.DAC 

5080 @scscecelecc5 loose ce seeeeeueee 


5100 
at40 SHIFT .MAC .LEFT.ONE 
LDY 


5120 #4 
2130 1 ASL MAC+ 
5140 ROL MAC+ 
5150 ROL MAC+ 
5160 ROL MAC+ 
ott? ROL MAC+ 
5180 ROL MAC+ 
5190 ROL MAC+ 
5200 ROL MAC+ 
5210 ROL MAC+1 
5220 ROL MAC 
2830 DEY 

52 BNE .1 
52 RTS 


5960 @nceecccleccneel ose ose Soe cle 


DP18 Input Conversion 


The input conversion subroutine processes characters from 
memory to produce a value in DAC. This is analogous to what 
the equivalent subroutine in Applesoft ROMs does. 


It is so analogous, in fact, that I even depend upon CHRGET and 
CHRGOT to fetch successive characters from memory. It is a lot 
faster than Applesoft conversion, however, because it is BCD 
coded rather than binary. This means that, stripping away the 
frills such as sign, exponent part, and decimal point, it even 
easier than an ASCII to hex conversion. 


Of course, we need all those frills. Look ahead to the program 
listing which follows: Lines 1200-1220, just those three 
little lines, handle the conversion of digits. All the rest of 
the page is for frills! Well, to be honest about it, two of 
the three lines call subroutines, but still, the frills 
predominate. 


The acceptable format of numbers is basically the same as that 
which normal Applesoft accepts. A leading sign is optional. 
The numeric portion can be more than 20 digits long, but only 
the first 20 will be accumulated (not counting leading zeroes). 
A decimal point is optional anywhere in the numeric portion. 

An exponent part can be appended to the numeric portion, and 
consists of the letter “E", and optional sign, and one or two 
digits. The exponent can be up to 8l, just so the final number 
evaluates between .1*10°-63 and .9999...9*10 63. Numbers 
smaller than .1*10°-63 will be changed to zero, and numbers 
larger than .9999...9*10°63 will cause an OVERFLOW ERROR. 


Looking at the program, lines 1040-1080 clear a working area 
which comprises DAC and four other variables: SGNEXP, EXP, 
DGTCNT, and DECFLG. SGNEXP will be used to hold the sign of 
the exponent part; EXP will hold the value of the exponent 
part; DGTCNT will count the digits in the numeric portion; and 
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DECFLG will flag the occurrence of a decimal point. DAC 
includes DAC.SIGN. Note that the X-register will be left with 
SFF, which fact is important at line 1170 below. 


Lines 1090-1100 preset the DAC.EXPONENT to $40, which indicates 
10°0. This will be incremented along with DGTCNT until a 
decimal point is encountered. 


Lines 1110-1180 handle the optional leading sign. DAC.SIGN has 
already been cleared above, indicating the positive case. Ifa 
minus sign is in front of the number, line 1170 sets DAC.SIGN 
negative. Note that calling CHRGOT and CHRGET to retrieve 
characters automatically eliminates (ignores) blanks. 
CHRGOT/CHRGET also checks whether the character retrieved is a 
digit or not, and indicates digits by carry clear. If the 
first non-blank character is a digit, we immediately jump to 
the numeric loop at line 1200. If not, the subroutine FIN.SIGN 
checks for a + or - character. The + or - may or may not be 
tokenized, depending on whether the string is from an INPUT 
statement or is a constant embedded in a program, so we have to 
check for both the character and the token form of both signs. 
FIN.SIGN handles this checking. 


If that first character is neither a digit nor a sign, it may 
be a letter "E"“ or a decimal point; so, we go down to lines 
1240-1270 to check for those two cases. If neither of these 
either, we must be at the end of the number. If it is a 
decimal point, lines 1630-1650 record the fact that a decimal 
point was found and also check whether this is the first one 
found or not. If the first, back we go to continue looking for 
digits. If not the first, it must be the end of the number, so 
we fall into the final processing section at line 1670. 


Exponents are more difficult, because the value actually must 
be converted from ASCII to binary. Lines 1290-1610 do the 
work, including handling of the optional sign, and range 
checking. 


Lines 1670-1730 compute the final exponent value. This is the 
number of digits before the decimal point (not counting any 
leading zeroes you may have typed to confuse me) plus the 
exponent computed in the optional “E“ field. If the result is 
negative, between SCO and SFF, it indicates underflow; in this 
case, the value is changed to zero. If there were no non-zero 
digits in the numeric portion, the value is set to zero 
regardless of any "E“ field. If the resulting exponent is 
between $80 and SBF, it indicates OVERFLOW. 


Lines 1840-2130 accumulate individual digits. DGTCNT is used 
to index into the nybbles of DAC, and the digit is stored 
directly into place. Leading zeroes on the numeric field are 
handled here (lines 2090-2120). Leading zeroes before a 
decimal point are entirely ignored, while leading zeroes after 
a decimal point cause the DAC.EXPONENT to be decremented. The 
incrementation of DAC.EXPONENT for each significant digit on 
the left of the decimal point is also taken care of here (lines 
2020-2070). 
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This complete the third installment of DP18. 
the way to a working subset of the entire package. 


We are well on 
We still 


need output conversion and some sort of linkage to Applesoft 
before we can begin to see it all run. The entire DP18 package 


really exists, and works, now. It includes 


PRINT USING, 


very 


fancy input screen handling, full expression parsing, and all 
the math functions. Several of you have been very anxious to 
get the whole package for use in projects of your Own, SO we 

have offered a source code license to DP18 on an “as is" 


for only $200. 


1000 SOAVE S.DP18 FIN 


1010 @eessecs cl cccc wesc es cocesccas 

1020 ®  DP18 INPUT CONVERSION 

1040 FIN LDA #0 CLEAR WORK AREA 

1050 LDX #WRKSZ-1 (DAC, SGNEXP, EXP 
1060 «1 STA WORK, x DGTCNT, & DECFLG) 
1080 BPL LEAVE X=$FF WHEN FINISHED 
1090 LDA #40 

1100 STA DAC. EXPONENT 

1110 ®---HANDLE LEADING SIGN---------- 

1120 JSR AS.CHRGOT 

1130 BCC .2 IF DIGIT 0-9 

1140 JSR FIN-SIGN ..eSEE IF + OR - SIGN 
1160 BCC .3 ck 

1170 STX DAC.SIGN ...-, SET TO $FF 

1180 BCS “a ySe YS 

1190 ®---GET DIGITS TILL NON-DIGIT---- 

1200 .2 JSR ACCUMULATE.DIGIT 

1210 3 JSR AS.CHRGET GET NEXT CHARACTER 
1220 .«DIGIT 

1230 ®on=",") SE", OR END=------------ 

120 .4 dup ¢!, DECIMAL POINT? 

1250 BEQ .9 YES 

1260 P #'E LETTER E 

1270 BNE .10 END OF NUMBER 

1280 #---HANDLE EXPONENT FIELDessese== 

1290 JSR AS.CHRGET 

1300 BCC -.-DIGIT, ASSUME POSITIVE 
1310 JSR FIN.SIGN ...SEE IF + OR - SIGN 

1 30 BCC <5 ee, 

1340 ROR SGNEXP ...-, SO SET SGNEXP NEGATIVE 
1350 .5 JSR AS.CHRGET ‘Se |T FIRST DIGIT OF EXP 
1360 BCS .8 .»NO DIGITS! 

1 70 .6 AND #$0F . ISOLATE EXP 1ST DIGIT 
1380 STA EXP 

1390 JSR AS.CHRGET GET 2ND DIGIT OF EXP, IF ANY 
1400 BCS .8 ...NO MORE DIGITS 

1410 AND #$0F ISOLATE 2ND DIGIT 

1420 PHA SAVE ON STACK 

1430 LDA EXP MULTIPLY 1ST DIGIT BY 10 
140 ASL 

1450 ASL (CLEARS CARRY TOO) 

1460 ADC EXP & 

1470 ASL #10 (CARRY STILL CLEAR) 
1480 STA EXP ADD 2ND DIGIT 

1490 PLA 

1500 ADC EXP 

1510 STA EXP 2 DIGIT EXP 

1520 CMP #64418 ALLOW ,9000000000000000 15 +82 
12ho JSR ig. cancer cet Reet en R 

1550 BCS . NO MORE DIGITS 

1560 7 JMP AS.OVRFLW OVERFLOW ERROR 

1570 : ASL SGNEXP CHECK SIGN oF EXP 

1580 BCC .10 »«sPOSIT 
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1590 LDA #0 .. NEGATIVE, SO COMPLEMENT EXP 
1600 SBC EXP 

1610 JMP .11 .. ALWAYS 

1620 #.--FOUND DECIMAL POINT---------- 

1 30 9 ROR DECFLG SET DECIMAL POINT FLAG 
1640 BIT DECFLG CHECK FOR TWO DECIMAL POINTS 
1650 BVC .3 NO 

1660 #---COMPUTE FINAL EXPONENT------- 

1 70 .10 LDA EXP GET EXPLICIT EXPONENT 
1680 .11 CLC 

1690 ADC DAC.EXPONENT 

1700 LDX DGTCNT SEE IF ANY SIGNIFICANT DIGITS 
1710 BNE .12 .. YES 

1720 TXA ...NO, MAKE EXPONENT ZERO 
1730 .12 STA DAC.EXPONENT 

1740 TAX TEST RANGE OF EXPONENT 
1750 BMI .13 .«eNOT IN RANGE 0...7F 
1760 RTS 

1770 #..-EITHER UNDER- OR OVER-FLOW--- 

1780 .13 += ASL R OVER? 

1790 BCC .7 .» OVERFLOW 

1800 0 

1810 STA DAC. SIGN 

1820 BEQ .12 .. ALWAYS 

1880 ACCUMULATE.DIGIT 

1850 AND #$0F ISOLATE DIGIT 

1860 BEQ . ZERO DIGIT 

1870 TAX SAVE DIGIT IN X-REG 
1880 LDA DGTCNT NO MORE THAN 20 SIGNIFICANT DIGITS 
1890 CMP #20 

1900 BCS .2 DISCARD EXTRA DIGITS 
1910 #---STORE THE DIGIT IN DAC------- 

1920 LSR ODD/EVEN TO CARRY 

1 30 TAY INDEX TO Y-REG 

19h0 TXA GET DIGIT FROM X-REG 
1950 BCS .1 ODD DIGIT ON RIGHT SIDE 
1960 ASL EVEN DIGIT MUST BE SHIFTED 
1970 ASL 

1980 ASL 

1990 ASL 

2000 .1 ORA DAC.HI,Y MERGE 

2010 STA DAC.HI,Y 

2020 #---COUNT THE DIGIT-------------- 

2030 .2 INC DGTCNT COUNT SIGNIFICANT DIGIT 
2040 LDA DECFLG SEE IF IN FRACTION 

2050 BMI .3 YES 

2060 INC DAC.EXPONENT NO 

2070 .3 RTS 

2080 SoseDIGIT = Osoaseeeecccese les cus 

2090 .4 LDA DGTCNT SEE IF LEADING ZERO 
2100 BNE .2 NO 

2110 LDA DECFLG SEE IF PART OF FRACTION 
2120 BPL .5 NO, COMPLETELY IGNORE IT 
2130 DEC DAC. EXPONENT 

2140 .5 RTS 

9150 seen as ooo bese cesta wecceuecosie 

2160 & SCAN + OR - SIGN 

2180 + .EQ., .CC. 

2190 # _ EQ. 9 e ) 

2200 # OTHER .NE. 

So1 Ov eseeeo coo eet a cn sosbeneecesesesse 

2220 FIN.SIGN 

2230 CMP ¢'-! 

2240 BEQ .2 

2250 CMP #TKN.MINUS 

2260 BEQ .2 

2270 CMP #'+! 

2280 BEQ .1 

2290 CMP #TKN. PLUS 

2300 .1 CLC 

2310 .2 RTS 
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Building Label Tables for DISASM...........BOb Sander-Cederlof 


RAK-Ware's DISASM has the nice feature of being able to used a 
list of pre-defined labels when you are disassembling a block 
of code. I needed to turn the //c monitor ROM (SF800-SFFFF) 
into source code, and Apple sent me a list of all their labels 
in this area. 


The format of the label table, or name table, is very simple. 
Each entry takes eight bytes: the first two are the value, 
high byte first; the remaining six are the label name, in ASCII 
with high bit set. If the name is less than six characters 
long, zeroes are used to fill out the entry. 


Very simple to explain, but how do you enter things like that 
in the S-C Macro Assembler? The example on the DISASM disk 
does it this way: 


1000 ~HS FDED 
1010 ~AS —/COUT/ 
1020 eHS 00000000 
1030 ~HS FDFO 
1040 eAS -—/COUT1/ 
1050 ~HS 000000 


and so on. 


That works, but it is so error prone and time wasting that I 
gave up before I started. However, there is an easy way uSing 
macros and abbreviations. 


Start by defining a macro which will build one entry: 


1000 eMA LBL 

1010 eHS }1 

1020 eAS -/]2/ 
1030 -BS *+7/8*8-* 
1040 « EM 


The macro is named LBL, and will be used like this: 


1050 >LBL FDED,COUT 
1060 >LBL FDFO,COUT1 


Line 1030 is the tricky one. This .BS will add just enough 
zeroes to an entry to pad it out to an even multiple of 8 
bytes. Now, assuming the origin started at an even multiple of 
8, and assuming you are writing the table on a target file, 
that macro builds the kind of entries DISASM wants. Instead of 
just assuming, lets add: 


0900 ~OR $4000 
0910 -TF B.NAMETBL 


I also mentioned abbreviations above. I even get tired of 
typing "tab>LBL ", you know. Usually when I have a lot of 
lines to type that have a common element, I use some special 
Character that is easy to type and not present in the lines I 
plan to type. Then after all the lines are in, I use the 
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REPLACE command to substitute the longer string for the 


Single-character abbreviation I have used. 


1050 .FDED,COUT 
1060 .FDFO,COUTI 
et cetera 

and after many lines type 


REP /./ >LBL /1050,A 


Thus, I can type: 


I was about up to FA90O when it dawned on me that I could break 
the symbols into blocks within a page, and include the page 


value in my abbreviation: 


1050 .ED,COUT 
1060 .FO,COUTI1 


REP /./ >LBL FD/1050,A 


With all these shortcuts, I was able to enter over 400 label 
names and definitions in less than an hour. 


Let the computer work FOR you! 


We Oo 


S-C Macro Assembler. The key to unlocking all the 
mysteries of machine language. Combined editor/ 
assembler with 29 commands, 20 directives. Macros, 
conditional assembly, global replace, edit, and more. 
Highest rating “The Book of Apple Software” in 1983 
and 1984. $80. 


Powerful cross-assembler modules also available to 
owners of S-C Macro Assembler. You can develop soft- 
ware on your Apple for 6800, 6805, 6809, 68000, 8085, 
8048, 8051, 1802, LSI-11, and Z-80 microprocessors. 
$50 each. 


S-C Xref. A support program which works with the S-C 
Macro Assembler to generate an alphabetized listing of 
all labels in a source file, showing with each label the 
the line number where it is defined along with all line 
numbers containing references to the label. You get the 
complete source code for this amazingly fast program, 
on disk in format for S-C Macro Assembler. $50. 


Full Screen Editor. Integrates with the built-in line- 
oriented editor in the S-C Macro Assembler to provide 
a powerful full-screen editor for your assembly lan- 
guage source files. Drivers for Videx, STB80, and Apple 
//e 80-column boards are included, as well as standard 
40-column version. Requires 64K RAM in your Apple. 
Complete source code on disk included. $50. 


Complete, Commented Source Code! 


O ft t onl locked and fully copyable 
hacks tien provide. the complete eGurcE. code on disk, at unbelievable prices! 


S-C Docu-Mentor for Applesoft. Complete documenta- 
tion of Applesoft caternali Using your ROM Applesoft, 
produces ready-to-assemble source code with full 
labels and comments. Educational, entertaining, and 
extremely helpful. Requires S-C Macro Assembler and 
two disk drives. $50. 


S-C Word Processor. The one we use for manuals, 
letters, our monthly newsletter, and whatever. 40- 
columns only, requires lower-case display and shiftkey 
mod. Works with standard DOS text files, but at super 
fast (100 sectors in 7 seconds). No competition to 
WordStar, but you get complete source code! $50. 


Apple Assembly Line. Monthly newsletter published 
since October, 1980, for assembly language pro- 
grammers or those who would like to be. Tutoria! arti- 
cles, advanced techniques, handy utility programs, 
and commented listings of code in DOS, ProDOS, and 
the Apple ROMs. Helps you get the most out of your 
Apple! $18/year. 


S-C SOFTWARE CORPORATION 
2331 Gus Thomasson, Suite 125 
Dallas, TX 75228 (214) 324-2050 


Protessional Apple Software Since 1978 
Viea, MasterCard, American Express, COO accepted. 
Apple 3 8 lracemna’t o' Apple Compule inc 
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Quick Memory TeSting....ccccccccccccceeseeeBOD Sander-Cederlof 


What do you do when a friend brings his Apple over with an 
intermittent memory failure? You KNOW you have a memory test 
program somewhere, but WHERE? 


Here is a quick way to test normal RAM between $7D0 and SBFFF. 
(RAM in //e hYperspace or banked into ROM space is another 
Matter.) Turn on your friend's computer, and hit reset to 
abort the booting sequence. We don't need or want DOS around 
while we are testing memory. Type HOME and CALL-151 to get 
into the monitor. Then type the following monitor command: 


*¥N 7D0:00 N 7D1<7D0.BFFEM 7D1<7D0.BFFEV 
7D0:55 N 7D1<7D0.BFFEM 7D1<7D0.BFFEV 
7DO:AA N 7D1<7D0.BFFEM 7D1<7D0.BFFEV 
7DO:FF N 7D1<7D0.BFFEM 7D1<7D0.BFFEV 
34:0 


The "*" is the monitor prompt, so don't you type it. There are 
no carriage returns in the line above, it just wraps around the 
40-column screen that way. There must be one trailing blank 
after the "34:0" at the end. This makes the monitor repeat the 
whole command line forever. 


I started the test at $7D0 so there will be some visible 
feedback, but most of the screen will stay clear. If you begin 
testing at a lower address, any errors displayed on the screen 
might be overwritten as soon as they show up. 


When you type the RETURN key you will see a line of inverse 
at-signs at the bottom of the screen. After a few seconds, 
this will change to flashing U. Then *, and then some other 
Character, depending on what kind of Apple you have. Then the 
cycle will start over again. 


Until a memory error is detected. Any error will cause two 
lines to be printed, giving the address before the error with 
its contents and the contents of the error byte, and the 
address of the error byte with its actual contents and 
Should-be contents. For example, if you were in the "AA" 
phase, and $8123 came up with SAB, you would see: 


8122-AA (AB) byte before error 
8123-AB (AA) error byte 


If any error lines start printing, note which bit is bad and 
which 16K bank it is in. Then you can point directly to the 
bad chip. 


7 6 5 4 3 2 1 «0 
7D0...3FFF C10 C9 C8 C7 C6 C5 C4 C3 
4000...7FFF D10 D9 D8 D7 D6 D5 D4 D3 
8000...BFFF E10 E9 E8 E7 E6 ES E4 E3 


Page 14..... Apple Assembly Line.....June, 1984......Copyright (C) S-C SOFTWARE 


Appleseed is a complete computer system. It is designed using the bus 
conventions established by Apple Computer for the Apple ][{. Appleseed is 
designed as an alternative to using a full Apple ][ computer system. The 
Appleseed product line includes more than a dozen items including CPU, 
RAM, EPROM, UART, UNIVERSAL Boards as well as a number of 
other compatible items. This ad will highlight the Mother board. 


BX-DE-12 MOTHER BOARD 


The BX-DE-12 Mother board is designed to be fully compatible with all of 
the Apple conventions. Ten card slots are provided. Seven of the slots are 
numbered in conformance with Apple standards. The additional three 
slots, lettered A, B and C, are used for boards which don’t require a 
specific slot number. The CPU, RAM and EPROM boards are often placed 
in the slots A, B and C. 


The main emphasis of the Appleseed system is illustrated by the Mother 
Board. The absolute minimum amount of circuitry is placed on the Mother 
Board; only the four ICs which are required for card slot selection are on 
the mother board. This approach helps in packaging (flexibility & smaller 
size), cost (buy only what you need) and repairability (isolate and fix pro- 
blems through board substitution). 


Appleseed products are made for O.E.M.s and serious industrial/scientific 
users. Send for literature on the full line of Appleseed products; and, watch 
here, each month, for additional items in the Appleseed line. 


Appleseed products are not sold through computer stores. 
Order direct from our plant in California. 


Apple is a registered trademark of Apple Computer, Inc. 


DOUGLAS ELECTRONICS 
718 Marina Blvd., San Leandro, CA 94577 © (415) 483-8770 
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68000 Sieve Benchmark... ..ccccccccsccesceeeee Peter J. MCInerney 
New Zealand 


Here are two versions of the Sieve of Eratosthenes for the 
MC68000. They provide ample justification for the power 
Claimed for this chip. 


The first version is a fairly straightforward translation of 
the algorithm as presented in the November 1982 AAL, by Tony 
Brightwell. Tony's best time in the 6502 was 183 seconds for 
1000 repetitions; in my 12.5 MHz DTACK GROUNDED attached 
processor, 1000 repetitions took only 40 seconds. 


Compare the 68000 code with the 6502 code, and I'm sure you 
will agree the 68000 version is much easier to understand. 
Note the use of long instructions in the array clearing loop 
and the two-dimensional indexing in lines 1230 and 1310. Other 
nice things are the shift left by 3 (multiply by 8) in line 
1270 and the decrement & branch instructions in lines 1120 and 
1400. Also very useful is the postincrement address mode, 
which automatically increments the address kept in the 
referenced register by l, 2, or 4 depending on the size of the 
operation. This is used for popping off (downward growing) 
stacks or aS here to advance through memory. There is also a 
predecrement mode but I did not use it in these example 
programs. 


The second version uses a modified algorithm. The changes I 
Made should apply to the 6502 version also, improving it in 
about the same proportion. 


* Since we are ignoring even numbers, we may as well leave 
them out of the array entirely, thus halving the array size. 


* We can therefore simplify the formula for odd squares from 
S*8+1 to S*4, 


* We can even do away with the *4 part by adding 4 each time 
rather than l. 


* The initial array clearing loop can be made faster by using 
more than one CLR instruction per loop. 


This modified version does 1000 iterations in only 33 seconds! 
It is only slightly harder to follow than the first version, 
and only slightly larger. In fact, if we forego the final 
modification above, the code is actually shorter. I think most 
of the speedup comes from halving the array size. 


If you have a Macintosh, and can manage to load machine code 
into it, you should find everything running about half as fast 
as my DTACK GROUNDED board. 


[ We tried the program on our QWERTY Q-68 board, and it took 
roughly 10 times as long as Peter's DG board. Understandable, 
Since it was using Apple memory at .5MHz rate for all work. 
(Bill&Bob) ] 
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1000 ®SAVE SIEVE OF ERATOSTHENES. 1 
1020 8 ® CODED BY PETER J. MCINERNEY, NEW ZEALAND 
john .OR $3800 
00004000- 1050 ARRAY BQ $4000 
00003800- 3C3C 03E7 1070 SIEVE MOVE #999 D6, DO 1000 TIMES 
1080 ®--—-CLEAR WORKING ARRAY---------- 
00003804- 307C 4000 1090 .1 MOVE aARRAT AO CLEAR ARRAY FROM 
00003808- 03¢ OFFF 1100 MOVE #grrr, BO 000 TO $7FFF 
0000380C- 42 1110 .2 CLR.L 
0000380E- 51C8 FFFC 1120 DBF DO,.2 
1130 #---INIT VARIABLES--------------- 
00003812- 700 1140 VEQ #3,D0 TA 3 
000038 14- 720 1150 MOVEQ #1,D1 SUM OF ODD NUMBERS 
00003816- 7401 1160 MOVEQ #1,D2 COUNT OF ODD NUMBERS 
00003818- 7601 1170 MOVEQ qi D3 USED FOR STRIKING NON-PRIMES 
000038 1A- O7¢ 4000 1180 MOVE Y,AO START OF ARRAY 
000038 1E- 60 1190 BRA.S JUMP INTO LOOP 
1200 #---START SIFTING---------------- 
00003820- 5242 1210 .3 ADDQ  § #1,D2 COUNT ODD NUMBERS 
0000 B22- D2u2 1220 ADD D2,D1 GET SUM OF ODDS 
00003824- 0C30 0000 
00003828- 000 1230 A CMPI.B #9, ,0(A0,DO) is THIS A PRIME? 
0000382A- 661 12 BNE.S 
1290 ®---STRIKE OUT MULTIPLES--------- 
0000382C- 3801 1260 MOVE D1,D GET 8°S+1 = NEN 
0000382E- E744 1270 ASL 43, 
000038 30- 5244 1280 ADDQ = #1,D 
900038 32- Aco 1290 MOVE  DO,D ONLY STRIKE ODD MULTIPLES 
00003834- £3k5 1300 ASL #15D 
000038 36- 1183 4000 1310 .5 MOVE.B D3,0(A0,D4) STRIKE ONE 
000038 3A- DBs 1320 ADD -D TRIKE, 
900038 3¢- Oc4h 4000 1330 cMPr ¢s4000,D4 ++ FINISHE 
00003840- 63F4 1340 BLS fick 
00003842- 5h40 S Peal a SIEVE SIZE NEXT ODD 
og036u OCAO OO7F 1370 curt #127 ,D0 7° Ranh Bib p Ont 


Q @---DO IT ALL 1000 TIMES-------- 
0000384A- 51CE FFB8 


1 
848- 63D6 : 0 

1400 DBF DO,. XT TIME 
O00038KE- 4E75 1 RTS 


may aoa 


#---D0 it ALL 10 0 aa 
DEF RT Tie 


QO3B4AR S40 
S000 uC- eee 0O7F 
O0003850- 63E0 


SORORREE= RUSE FPmO 


NOOO @IH 


Oooo0o°0o 


1000 ®SAVE SIEVE OF ERATOSTHENES. 2 
1020 ® CODED BY PETER J. MCINERNEY, NEW ZEALAND 
1086 R $7800 
00004000- 1030 ARRAY "Ba 000 
00003800- 3C3C 03E7 1070 SIEVE MOVE #99 By DO 1000 TIPE 
a 1050 #---CLEAR WORKIN poeeuecce 
00003804- 307C 4000 1090 .1 MOVE MARRAC A N Vy] 
00003808- 3036 OOFF 1100 MOVE SFr, D pb ae tty, An V4 
0000380C- 42 1110 .2 CLR.L 0)+ 
09003805 499 1120 CLR.L (A0)+ 
00003810- 429 1130 CLR.L (A0)+ 
00003812- 4298 1140 CLR.L (A0)+ 
000038 14- 4298 1190 CLR.L (A0)+ 
000038 16- 42 1160 CLR.L (A0)+ 
000038 18- 429 1170 CLR.L (A0)+ 
000038 1A- 4298 1180 CLR.L CAO $ 
0000381C- 51C8 FFEE 1190 ; oa i 
003820- 7003 1210 MOVEQ TAT i 
80003829- hy een a ONDA TO Y 
00003826- 7601 13h6 ’ i Gk me na wa NON - PI DOK 
00003828- 307C 4001 1250 i reat 4 is Ne 
Bo ooat age ray ue is ° Varat tert ANAAL 
. 1260 e-—-START ‘grPTt ior Bomoseen 
00003832- 5842 1290 .3 } y pal , 
00036 he D842 1300 ADD. 21) ypiare niet NEN 
9000 : 6- 016 0000 1310 .4 CMPL .B AO)+ IN ihn 
mckleaenc 1340 @---STRIKE OUT MULTI PLEA 
0000383C- 3A04 1340 MOVE ge ven i TL 
000038 E- 1383 5000 0 5 MOVE. .B OD Bgl »Dd Mut if 
0000384 he Och 2000 1370 CMPI B3h0 0,D5 apa ' 
0000384 8- 63F 
1 
1 
1 
1 
1 
1 


frre: 


E75 


wi 
© 
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Updating the 6502 Prime Sifter.............BOob Sander-Cederlof 


I spent a half day applying Peter's algorithm improvements to 
the November 1982 6502 version, and refining the program as 
much as I could. It now runs in 175 milliseconds per 
iteration, or 1000 iterations in 175 seconds. Still way behind 
the 68000, of course. On the other hand, a 6MHz 6502, with 
fast enough RAM for no wait states, would be faster than a 12.5 
MHz 68000. And it remains to be seen what the 65802 could do. 


In the process of running various versions and various tests, I 
discovered that the innermost loop, at lines 1820-1850, is 
executed 10277 times. This means that, while marking out the 
odd non-primes between 1 and 16383, a total of 10277 such marks 
are made. Since only odd numbers are assigned slots in the 
working array, giving only 8192 such slots, you can see that 
some numbers get stricken more than once. These are the 
numbers with more than one prime factor. The most-stricken 
number is 3*5*7*11*13 = 15015, which gets five strikes. The 
loop takes 11 cycles as written, and I don't see any way to 
shorten it any further or to reduce the number of times it is 
used. Do you? 


The loop time is 11*10277 = 121297 cycles, or about 120 msec 
out of the total 175. The array clearing accounts for another 
41 msecs, leaving only 14 msec for all the rest of the program. 
Not bad! 


Here is a little Applesoft program which will make a nice neat 
listing of primes from the working array, assuming it runs from 
$6000 through S$7FFF. 


100 HIMEM :24576 
110 FOR A = 24576 TO 32767 
120 IF PEEK (A) = 0 THEN 


PRINT RIGHTS(" “+STRS ((A-24576) *2+1,7) 3: 
N=N#t1 
130 IF N = 10 THEN PRINT : N = 0 
140 NEXT 
1010 8SAVE S.SUPER-FAST PRIMES IMPROVED 
1920 -OR $8000 SAFELY OUT OF WAY 
6000- 1086 BASE .EQ $6000 BASE OF PRIME ARRAY 
FF3A- 1050 BEEP .EQ $FF3A BEEP THE SPEAKER 
00- 1060 SQZzzzZ .EQ 0,1 
02- 1070 START .EQ 2 
Oh. en COUNT .EQ 4,5 
1100 MA ZERO. 8©=©=——t—~COCOCS 
1110 STA ]1+$000,X 
1120 STA ]1+$100,X 
1130 STA ]1+$200,X 
1140 STA J1+ 300,x 
1190 STA ]1+$400,X 
1160 STA ]1+$500,X 
1179 STA ]1+$600,X 
1180 STA J1+$700,X 
1190 .—M 
1200 Seseeese ec cee ceececcee sols ceesece 
1210 # MAIN CALLING ROUTINE 
1220 # 


Page 18.....Apple Assembly Line.....June, 1984......Copyright (C) S-C SOFTWARE 


B01 - Ae 00 


co 
E 
t 
Ono 
Ve] 
oo 
oo 


fo] 
=) 
w 
© 
‘ 
2) onl 
See SSBBR skese 
>) 
Ls) 


FF 


60 


80 
80 


FF 


80 


1290 


Ooooqooecooo0eo 


2 See es 
© 


COI AUN EW) N= OO CO QU BW A —*§ OW OQ Sw © 


C009 0909.09.09 0900.00 COIS SII IIIT ANAHAAHRAAAACANMNINIASIONIG & SSS SSL SS SWWWW Ww Wwwwa) 


CO] QU 22) IO 


BAD PAD ce ced cad ce cath ceed we ceed ced ed cad ed ed ad ed end and enh onb nb ed oe oh eh eh etd ee th ot cee ed oe eth od et oh ed et od ed adh ed od th wd od nt oe ed oh ed et ot od ds sd ds oss as ond od 2 2b od = od 


OC OOOO WOWOO01010 


=o 
oo 


MAIN LDA #-100 DO 1000 TIMES SO WE CAN MEASURE 
STA COUNT THE 


A TIME IT TAKES 
LDA /-10 
STA COUNT+1 
JSR BEEP ANNOUNCE START 
1 JSR PRIME 
INC COUNT 
INC COUNT+1 
. JMP BEEP SAY WE'RE DONE 
# PRIME ROUTINE 
# SETS ARRAY STARTING AT BASE 
a TO $FF IF NUMBER IS NOT PRIME 
& CHECKS ONLY ODD NUMBERS > 3 
& INC = INCREMENT OF KNOCKOUT 
; N = KNOCKOUT VARIABLE 
PRIME 
LDX #0 
TXA CLEAR WORKING ARRAY 
1 >ZERO BASE 
>ZERO BASE+$0800 
>ZERO BASE+$1000 
>ZERO BASE+$1800 
P BNE .1 NOT FINISHED CLEARING 
LDA /BASE+4 POINT AT FIRST PRIME-SQUARED 
STA SQZZZZ+1 (WHICH IS 3439) 
LDA #BASE+4 
STA SQZZZZ 
LDA #1 POINT AT FIRST PRIME (3) 
P BNE .4 ~e ALWAYS 
.2 TXA 
ASL 
ASL 
ADC SQZZZZ 
STA SQZZZZ 
BCC . 


INC Sdzz2201 


3 LDA ae GET A POSSIBLE PRIME 


BNE THIS ONE HAS BEEN KNOCKED OUT 
P TXA 
74 STA START 

ASL INC = START#2 + 1 

ADC #1 

STA 


~T+1 
LDA SQZZZZ+1 MOVE MULT TO N 
STA .64+2 
LDA SQZZZZ 


5 TAX 
BEQ .9 ~ee SPECIAL CASE FOR X20 
#~-—-STRIKE OUT MULTIPLES--------~- 
6 STA pergOex REMEMBER THAT N IS REALLY AT . 64 
‘ ae $8 N = N + INC 
Bee 6 DONT'T BOTHER TO ADD, NO CARRY 
INC .6+42 INC HIGH ORDER 
‘ BPL .5 ~eeNOT FINISHED 
LDX START GET OUR NEXT KNOCKOUT 
8 INX POINT AT NEXT ODD NUMBER 
CPX #64 UP TO 127 
BCC .2 WE'RE DONE IF X>127 
. RTS 
9 LDA .6+2 
STA .10+2 
~10 STA $FFO0 
TXA 
BEQ .7 ~-e ALWAYS 
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Sorting and Swapping... .cecececccceeseeeeeeBOD Sander-Cederlof 


Jack McDonald, writing in the July 1984 Software News, posed a 
puzzle for programmers: using nothing more than a series of 
calls to a SWAP, sort five items into ascending order. SWAP 
compares two items according to the indexes supplied, and 
exchanges the items if they are out of order. For example, 
calls on SWAP which follow the pattern of a “Bubble Sort” would 
look like this: 


SWAP (1,2) SWAP (1,2) SWAP (1,2) SWAP (1,2) 
SWAP (2,3) SWAP (2,3) SWAP (2,3) 

SWAP (3,4) SWAP (3,4) 

SWAP (4,5) 


That is ten swaps, which is more than necessary. You can do it 
in nine, which was McDonalds Puzzle. He gave an answer, and I 
found another. It was fun writing some quick code to test 
various swap-lists. 


First I wrote a macro named "S“ which loaded the two index 
numbers into X and Y, and called a subroutine named SWAP. See 
it in lines 1030-1070. 


Then I coded SWAP (lines 1200-1290), which compared two bytes 
at BASE,X and BASE,Y; if they were out of order, I Swapped them 
around. To make things easy for me, I put BASE at $500, which 
just happens to be the third line on the video screen. That 
way I could watch everything happen without struggling to code 
I/O routines. 


I wrote a program which would initialize a 5-byte string to all 
$01 (no program, really just a data definition at line 1670); 
another which copies the string to BASE (LOAD, lines 
1590-1650); another which counts up from 0101010101 to 
0505050505, so that all possible combinations would be run 
through (NEXT, lines 1770-1870); and another to do all these in 
connection with SORT, which performed a list of SWAP calls. 

The result was a method for visualizing and checking various 
groups of SWAPs to see if they could sort any initial 
permutation into ascending order. Assemble, and type MGO NEXT 
to see it all work. 


Here is the code, with two possible SWAP orders which work, of 
nine steps each. 


Ng. a ana ora 
One LDX ab 

1050 LDY #j2 

1060 JSR SWAP 

Hehe EM 

1080 #2... nn renee eee owen nnn 
1090 e-MA INC 

1100 INC PERM+ }1 

1110 LDA PERM+ ]1 

7120 CMP #6 

et BCC : 

1140 LDA #1 

1150 STA PERM+]1 

1160 3:1 

1170 EM 
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---------------------- APPLE SOFTWARE ----------------------- 


NEW!!! FONT DOWNLOADER & EDITOR ($39.00) 

Turn your printer into a custos typesetter. Downloaded characters reaain active while printer is powered. Can be used 
with every word processor capable of sending ESC and control codes to the printer. Switch back and forth easily between 
standard and custoa fonts. All special printer functions (like expanded, coapressed, eaphasized, underlined, etc.) apply 
to custoa fonts. Full HIRES screen editor lets you create your own custoe characters and special graphics syebols. 
Coapatible with aany ‘duab’ & ‘seart’ printer I/F cards. User driver option provided. Specify printer: Apple Dot Matrix 
Printer, C.Itoh B510A (Prowriter), Epson FX-80/100 or Okidata 92/93. 


DISASM 2.2e - AN INTELLIGENT DISASSEMBLER ($30.00) 

Investigate the inner workings of aachine language prograas. DISASH converts 6502 sachine code into aeaningful, ayebolic 
source. Creates a standard DOS 3.3 text file which is directly coepatible with DOS ToolKit, LISA and &-C (4.0 and MACAO) 
asseablers. Handles data tables, displaced object code & even lets you substitute your oun aeaningful labels. (100 
cosaonly used Monitor & Pg Zero pg naaes included.) An address-based cross reference table provides further insight into 
the inner workings of eachine language prograes. DISASM is an invaluable sachine language learning aid to both the 
novice & expert alike. GOURCE codes $640.00 


S-C ASSEMBLER (Ver4.0 only) SUPPORT UTILITY PACKAGE (6.0.00) 
# SC.XREF - Generates a GLOBAL LABEL Cross Reference Table for coaplete docusentation of source listings. Formatting 
control accosodates all printer widths for best hardcopy outputs. # SC.GSR - Global Search and Replace elieinates 
teadious aanual renasing of labels. Search all or part of source. Optional proapting for user verification. ¢ OC. TAB 
Tabulates source files into neat, readable fore. GOURCE code: #40.00 


~-------------------- HARDWARE/FIRMWARE ------ 


THE ‘PERFORMER’ CARD ($39.00) 

Plugs into any Apple slot to convert your ‘duab’ centronics-type printer I/F card into a ‘saart one. Coeaand esau 
provides easy access to printer fonts. Eliainates need to reaeaber coeplicated ESC codes and key thee in tu setup 
printer. Added features include perforation skip, auto page nuebering with date & title. Also includes large WINS 
graphics screen duep in noraal or inverse plus full page TEXT screen duap. Specify printer: Epson Al Q0 with 
Graftrax-80, M¥-100, MX-80/100 with GraftraxPlus, NEC 80923A, C. Itoh 8510 (Prowriter), Okidata O2A/B3A with Obigraph & 
OpkiData 92/93. Oki bonus: print EMPHASIZED & DOUBLE STRIKE fonts!) GOQURCE codes $30.00 


FIRMWARE FOR APPLE-CAT: The ‘MIRROR’ ROM ($25.00) 

Cosaunications ROM plugs directly into Novation’s Apple-Cat Modes card. Three basic eodes: Dueb lerainel, Aoante ‘eneole 
& Prograssable Modes. Added features include: selectable pulse or tone dialing, true dialtone detection, audible ring 
detect, ring-back option and built-in printer buffer. Supports aost 80-coluan displays and the | wire shill boy aud 

Uses a superset of Apple‘s Come card and Microsodea IJ comands. A-C hardware differences prevent 100% cuapalibility 
with Coma card. SOURCE code: $60.00 


RAM/ROM DEVELOPMENT BOARD ($30.00) 

Plugs into any Apple slot. Holds one user-supplied 2Kx8 seaory chip. Use a 6114 type RAM chip for prvgree develnpeent or 
just extra aesory. Plug in a preprograased 2716 EPROM to keep your favorite routines ‘on-line . A veruelile board with 
gany uses! Haps into $(n00-CnFF and $CQ00-CFFF semory space. Circuit diagrae included. 


NEW! ! 5 SINGLE BOARD COMPUTER KIT ($20.00) 

Kit includes etched PC board (with solder eask and plated thru holes) and asseably instructione. Veer providoe AO) CPU 
6116 2K RAN, 6821 dual 68-bit [/0 and 2732 4K EPROM plus eisc comson parts. Originally designed ou intelligent printer 
interface - easily adapted to sany applications needing dedicated controller. (Asseabled and touted: 119 


All assembly language SOURCE code is fully coasented & provided in both S-C Asseabler & atanderd (FIT tweet é 
Apple DOS 3.3 diskette. Specify your systea configuration with order. Avoid a $3.00 postage and handling tharye hy 
enclosing full payaent with order (MasterCard & VISA excluded). Ask about our products for the Vil ond (waande a 64 


RAK -WARE 
41 Ralph Road West Orange NJ 07052 (208) 1mm 
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1180 
1190 # SWAP (X,Y) 
1200, Bxpeseeo reer ss eseseesssecadsces 
0800- BD 00 05 1210 SWAP LDA BASE,Xx 
0803- D9 00 05 1220 CMP BASE,Y 
tt San {rn es ty PHA 
0809- B9 00 05 1250 LDA BASE ,Y 
OBOF. @8 > 1570 ty aenanioe 
08 10- 99 00 05 1360 STA BASE,Y 
0813- 60 1290 1 RTS 
1310 ® SORT BY SWAPS 
1290 Baoctoceceseee ceo acece coe eee 
1 30 SORT 
1340 .DO 0 CHANGE TO 1 TO SELECT MCDONALD'S LIST 
1350 S 4,5 MCDONALD'S ORDER 
1360 SS t 
1 70 S$ 3; 
1380 ss 152 
1390 8 1,4 
1400 S$ 1, 
1410 >S 2, 
30 38 3°3 
14ho . ELSE 
08 14- 1450 $ 1,4 MY ORDER 
08 1B- 1460 dS 2,5 
0822- 1470 S 1,3 
(in ee 
08 37- 1500 >S 1,2 
| nn nee 
a 9 
084C- 1530 38 5 
0853- 60 1590 . RTS 
0500- 1570 BASE EQ $500 
O854- A2 05 1390 LOAD LDX #5 COPY PERM LIST TO BASE ON SCREEN 
0856- BD 63 08 1600 .1 LDA PERM, 
0859- 9D 00 05 1610 STA BASE,X 
085C- 9D 80 05 1620 STA BASE+128 ,X 
O85F- CA 16 30 DEX 
0860- DO F4 1640 BNE .1 
0862- 60 1650 RTS 
0863~ 00 01 01 lc 
0866- 01 01 01 1670 PERM .HS 000101010101 
0869- A2 O4 1690 CHECK LDX #4 CHECK IF LIST IS SORTED 
086B- BD 01 05 1700 .1 LDA BASE+1,X 
O86E- DD 00 05 1710 CMP BASE, 
0871- 90 0 1720 BCC . 
0873- CA 1730 DEX 
O87h- DO F5 1740 BNE .1 
0876- 60 1750 62 RTS 
0877 - 1770 NEXT >INC 5 INCREMENT PERM LIST 
0886- 90 45 1780 BCC .1 EACH BYTE RANGES FROM 
0888- 1 90 SINC 4 01 TO 05 
0 2 - 90 34 1800 BCC .1 
0849- 1810 INC 3 
O8A8- 90 23 1820 BCC .1 
O8AA- 1830 INC 2 
O8B9- 90 12 18 0 BCC .1 
BEE ooo sgt 
o8cc- 60 1870 RTS FINISHED 
O8CD- 20 54 0 18 0.1 JSR LOAD COPY PERMLIST TO SCREEN 
O8D0- 20 14 08 1890 JSR SORT SORT IT ON THE SCREEN 
08D3- 20 69 08 1900 JSR CHECK CHECK IF SORTED 
O8D6- BO OF 1910 BCS NEXT ...SORTED, TRY NEXT SEQUENCE 
O8D8- 60 1920 RTS ». NOT SORTED 
1030. Wace oe scent eee cece cows nesses 
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I also got interested in permutation generation, and came up 
with the following macros and code to generate all 120 
permutations of five items, without any extra steps, each step 
being the simple interchange of two items. Assemble, and type 
MGO PERMUTE to see it generate 120 strings of the letters ABCDE 
in different arrangements. 


1940 .MA SS 
1950 LDX #]1 
1960 LDY #12 
1970 JSR EXCHANGE 
1980 EM 
1990 #ieceas- cca n sect eee ee cecss seuss 
2000 EXCHANGE 
O8D9- BD 63 08 2010 LDA PERM, X 
O8DC- 48 2020 PHA 
O8DD- B9 63 08 2030 LDA PERM, Y 
O8E0- 9D 63 08 20K0 STA PERM, X 
O8E3- 68 2050 PLA 
O8EL- 99 63 08 2060 STA PERM,Y 
O8E7- A201 2070 LDX 
O8E9- BD 63 08 2080 .1 LDA PERM,X 
O8EC- 09 C 2090 ORA #$CO 
O8EE- 20 ED FD 2100 JSR $FDED 
O8F1- £8 2110 INX 
O8F2- E0 06 2120 CPX #6 
O8F4- 90 F3. 2130 BCC . 
O8F6- A9 AO = 210 LDA #$A0 
O8F8- 20 ED FD 2150 JSR $FDED 
O8FB- 60 2160 RTS 
DIO" Baeee te eaeeceelee eee eon eu oes 
2180 MA S3 
2190 >SS 1,2 
2200 28S 1,3 
2210 SS 1; 
2220 SS 1,3 
2230 SS 1,2 
22 . EM 
9250) Weseees coca eek ace eeaeeeececeel 
2260 .MA S4 
2270 S83 
2280 JSR $FD8E 
2290 SS 1,4 
2300 83 
2310 JSR $FD8E 
2320 SS 2,4 
2330 ©6583 
23h0 JSR $FD8E 
2350 SS 3,4 
2360 S83 
2370 JSR $FD8E 
2380 : 
2300 Sacesee ose le overs eo 
2400 PERMUTE 
O8FC- A205 = 2410 #5 
O8FE- 8A 2420 .1 TXA 
O8FF- 9D 63 08 2430 STA PERM,X 
0902- y DEX 
0903- DO F9 = 2450 BNE .1 
2NGO Seseeoen cacce ae uee cee en Ewecusus 
0905- 2470 >SS 1,1 
090C- 2480 Ss 
09B9- 2490 >SS 1,5 
09c0- 2500 os 
OA6D~ 2510 SS 1,5 
OAT4~ 2520 Sh 
0B21- 2530 SS 1,5 
0B28~ 2540 OS 
OBD5- 2550 288 1,5 
OBDC- 25 >S 
2510) Becneac sve see eee eee detest 
0C89- 60 2580 RTS 


Apple Assembly Line.....July, 1984......Copyright (C) S-C SOFTWARE.....Page 23 


Our //c came in, and we love it. However... 


The //c package does not include any DOS 3.3 master. 

Everything is ProDOS. Of course you do get a DOS 3.3 with most 
software you purchase. And of course ProDOS includes a disk 
copier that is supposed to be able to copy DOS 3.3 disks when 
you need to back up your DOS~-based software. However... 


The ProDOS disk copier which is being shipped with the //c has 
a serious bug. When you are copying a DOS-based disk it 
ignores the volume number recorded on the source disk, and 
forces the copy to be volume 254. That is fine if the source 
just happened to be volume 254 also, but chances are it isn't. 
I have many disks around here which are volume 1. The DOS 
image and the VTOC both think the disk copied by //c ProDOS is 
volume 1, but RWTS discovers it is volume 254 and refuses to 
cooperate any further. 


I guess the solution is to use the old faithful COPYA from your 
DOS 3.3 System Master. Since that doesn't come with a //c 
system, we are including licensed copies of COPYA and FID on 
our Macro 1.1 disks now. | 


More gotchas.... Apple decided it was time to rewrite large 
chunks of the monitor. Necessarily so, because the 
disassembler now has to cope with 27 new opcodes and address 
modes. The removed four entries from the monitor command 
table, and changed its starting point. This throws off the "$* 
command in the S-C Macro Assemblers, all versions. 


If you have Macro 1.1, the //e version is the one you should be 
running in your //c. You can fix the “$* command with these 
patches: 


$1000 $D000 old new 
version version value value 
$147B $D47B $17 $13 
$1486 $D486 SCC SCD 
$148B SD48B $15 $11 


A more elegant patch is possible, which automatically adjusts 
for whether you are ina //e or //c. If you want this, and 
have a 1.1 version prior to serial # 675, send us $5 for an 
update. 


We have tried RAK-Ware's DISASM 2.2e on our //c, and it works 
fine. It even picks up the 27 new opcodes and address modes 
automatically, because DISASM links to the monitor 
disassembler. Older versions of DISASM will not run ona //e 


or //c. 
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OrphansS and WidOWS..ccccccccccccsceeseeeeeeBOb Sander-Cederlof 


James, a brother of Jesus Christ, wrote: “Pure religion and 
undefiled before God and the Father is this, to visit the 
fatherless and widows in their affliction, and to keep himself 
unspotted from the world." (chapter 1, verse 27, King James 
Version) 


Of course, he was referring to real life and to real people 
with real needs, but it still serves to introduce this little 
announcement. 


"Orphans" and “widows" are also terms used in word processing 
to describe the lamentable situation of one line of a paragraph 
being left all alone on one page, while the rest is on another 
page. If that one line is the last line of a paragraph which 
won't quite fit, “she* is forced to the top of the next page, 
and iS a widow. If the lonely line is the first line of a 
paragraph, dwelling at the bottom of a page, bereft of the rest 
of its family on the following page, he or she is indeed an 
orphan. 


High class word processors give you the option of automatically 
"visiting" orphans and widows “in their affliction“. Thanks to 
Bobby Deen, this feature is now (as of June 29th) included in 
the S-C Word Processor (whether high class or not). When the 
feature is selected (by the "“!orl“ directive), orphans get 
moved to the next page and widows get squeezed onto the current 
page. 


Bobby is also working on, and he says it is now functional but 
somewhat unfinished, a version that fully uses the 80-column 
display on the Apple //e. We already had 80-column preview, 
but he is developing 80-column text display during edit/entry 
mode. 


Don Lancaster's AWliIie TOOLKIT 


Solve all of your Applewriter™ IIe hassles with these eight diskette sides 
crammed full of most-needed goodies including ... 


Patches for NULL, shortline, IIc detrashing, full expansion 
Invisible and automatic microjustify and proportional space 
Complete, thorough, and fully commented disassembly script 
Detailed source code capturing instructions for custom mods 
Clear and useful answers to hundreds of most-asked questions 
Camera ready print quality secrets (like this ad, ferinstance) 
New and mind-blowing WPL routines you simply won't believe 
Self-Prompting (!) glossaries for Diablo, Epson, many others 
Includes a free "must have" bonus book and helpline service 


All this and bunches more for only $39.95. Everything is unlocked and 
unprotected. Order from SYNERGETICS, 746 First Street, Box 809-AAL, 
Thatcher, AZ, 85552. (602) 428-4073. VISA or MC accepted. 
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Speed Vs. SPACC 4-0 6 6:66 6.660 0.0 06 SO Cs. 6 een oe BOD Sander-Cederlof 


There are always tradeoffs. If you have plenty of memory, you 
can write faster code. If you have plenty of time, you can 
write smaller code. In an “academic”™ situation you may have 
plenty of both, so you can write “creative” code, stretching 
the frontiers of knowledge. Ina "real“ world it seems there 
is never enough time or memory, so projects have to be finished 
on a very short schedule, fit in a tiny ROM or RAM, and run 
like greased lightning. 


A case in point is last month's installment of the DP18 series; 
the SHIFT.MAC.RIGHT.ONE subroutine on page 8 takes about 1827 
clock cycles, and fits in 25 bytes. Upon reflection, I see a 
way to write a 34-byte version that takes only 1029 cycles. If 
I can use nine more bytes, I can shave about 800 microseconds 
off each and every multiply. (Maybe a total of a whole minute 
per day!) That might be important, or it might not; but seeing 
the two techniques side-by-side is probably valuable. 


1970 SHIFT.MAC.RIGHT.ONE 


1980 LDY #4 4 BITS RIGHT 
1990 .0 LDX #1 20 BYTES 

2000 LSR MAC 

2010 .1 ROR MAC ,X 

2020 INX NEXT BYTE 

2030 PHP 

2040 CPX #20 

2050 BCS .2 NO MORE BYTES 
2060 PLP 

2070 JMP .l 

2080 .2 PLP 

2090 DEY NEXT BIT 

2100 BNE .0 

2110 RTS 

1970 SHIFT.MAC.RIGHT.ONE 

1980 LDX #0 FOR X=0 TO 19 
1990 TXA NEW 1ST NYBBLE = 0 
2000 .1 STA TEMP SAVE FOR HI NYBBLE 
2010 LDA MAC,X MOVE LOW NYBBLE 
2020 ASL TO HI SIDE 
2030 ASL 

2040 ASL 

2050 ASL 

2060 PHA SAVE ON STACK 
2070 LDA MAC,X MOVE HI NYBBLE 
2080 LSR TO LOW SIDE 
2090 LSR 

2100 LSR 

2110 LSR 

2120 ORA TEMP MERGE WITH NEW 
2130 STA MAC ,X HI NYBBLE 
2140 PLA HI NYBBLE OF NEXT BYTE 
2150 INX NEXT X 

2160 CPX #20 

2170 BCC .l 

2180 RTS 
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The smaller method uses two nested loops. The inner loop 
shifts all 20 bytes of MAC right one bit. The outer loop does 
the inner loop four times. If I counted cycles correctly, the 
time is 4*(19*23+18)+7. The faster method uses one loop to 
scan through the twenty bytes one time. The timing works out 
as 20*51+9. 


Upon still further reflection, it dawned on me that a 38 byte 
version could run in 840 cycles! This version processes the 
bytes from right to left instead of left to right; eliminates 
the PHA-PLA and STA-ORA TEMP of the second version above; and 
loops only 19 times rather than 20. The timing is 19*43+23. 


1970 SHIFT.MAC.RIGHT.ONE 


1980 LDX #19 FOR X = 19 TO 1 STEP -1 
1990 .1 LDA MAC ,X SHIFT HI- TO LO- 

2000 LSR 

2010 LSR 

2020 LSR 

2030 LSR 

2040 STA MAC ,X SAVE IN FORM OX 

2050 LDA MAC-1,X GET LO- OF HIGHER BYTE 
2060 ASL 

2070 ASL 

2080 ASL 

2090 ASL 

2100 ORA MAC ,X MERGE THE NYBBLES 

2110 STA MAC ,X 

2120 DEX NEXT X 

2130 BNE .l -- UNTIL 0 

2140 LDA MAC PROCESS HIGHEST BYTE 
2150 LSR INTRODUCE LEADING ZERO 
2160 LSR 

2170 LSR 

2180 LSR 

2190 STA MAC 

2200 RTS 


Of course an even faster approach is to emulate the loops I 
wrote for shifting 10-bytes left or right 4-bits. The program 
would look like this: 


1970 SHIFT.MAC.RIGHT.ONE 


1980 LDY #4 
1990 .1 LSR MAC 
2000 LSR MAC+1 
2180 LSR MAC+19 
2190 DEY 

2200 BNE .l 
2210 RTS 


This version takes 2+3*20+4 = 66 bytes. Yet the timing is only 
(4*6+5) *20+7 = 587 clock cycles. And by writing out the four 
loops all the way, we use 4*3*20 = 240 bytes; the time would be 
4*6*20 or 480 cycles. 
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How about another example? The MULTIPLY.ARG.BY.N subroutine on 
the same page last month was nice and short, but very slow. 

The subroutine is called once for each non-zero digit in the 
multiplier, or up to 20 times. What it does is add the 
multiplicand to MAC the number of times corresponding to the 
current multplier digit. If we assume the distribution of 
digits is random, with equal probablility for any digit 1...9 
in any position, the average number of adds will be 5. 

Actually there will be zero digits too, so the average will be 
4.5 instead of 5, with the subroutine not even being called for 
zero digits. 


For 20 digits, 4.5 addition loops per digit, that is an average 
of 90 addition loops. And a maximum, when all digits are 9, of 
180 addition loops. 


Now, if there is enough RAM around, we can pre-calculate all 
partial products from 1 to 9 of the multiplicand and save them 
in a buffer area. Each partial product will take 1l bytes. We 
already have the first one in ARG, so for 2...9 we will need 
8*11 or 88 bytes of storage. It will take 8 addition loops to 
form these partial products. Once they are all stored, the 
MULTIPLY.ARG.BY.N subroutine will always do exactly one 
addition loop no matter what the non-zero digit is. Therefore 
the maximum number of addition loops is 8+20 or 28, compared to 
180! And the average (assuming there will be 2 zero digits out 
of 20 on the average) will be 26 addition loops. 


The inner loop in MULTIPLY.ARG.BY.N, called “addition loop” 
above, takes 20 cycles. If we implement this new method, we 
will have shortened the average case from 1800 to 520 cycles, 
and the maximum from 3600 to 560 cycles. Of course the whole 
DMULT routine includes more time-consuming code, but this 
subroutine was the biggest factor. Taking the 
SHIFT.MAC.RIGHT.ONE improvements also, we have shortened the 
overall time in the average case by 2078 cycles, or 2 
milliseconds per multiply. In the maximum case, the savings is 
nearly 4 milliseconds. 


Of course, it takes more code space as well as the 88-byte 
partial product buffer for the new method. And it will take 
more time to write such a program. You have to make tradeoffs. 
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